翻訳と辞書
Words near each other
・ VOC ship Landskroon
・ Voca
・ Voca People
・ Voca, Texas
・ Vocab (song)
・ Vocable
・ Vocabolario siciliano
・ Vocabulario de la Lengua Tagala
・ Vocabulario en lengua castellana y mexicana
・ Vocabulario manual de las lenguas castellana y mexicana
・ Vocabulario trilingüe
・ Vocabularius ex quo
・ Vocabulary
・ Vocabulary (album)
・ Vocabulary development
Vocabulary mismatch
・ Vocabulary OneSource
・ Vocabulary trainer
・ Vocabulary-based transformation
・ Vocabularyclept poem
・ Vocabulon
・ Vocal (Pet Shop Boys song)
・ Vocal coach
・ Vocal communication
・ Vocal cord dysfunction
・ Vocal cord paresis
・ Vocal effort
・ Vocal Few
・ Vocal fold nodule
・ Vocal folds


Dictionary Lists
翻訳と辞書 辞書検索 [ 開発暫定版 ]
スポンサード リンク

Vocabulary mismatch : ウィキペディア英語版
Vocabulary mismatch

Vocabulary mismatch is a common phenomenon in the usage of natural languages, occurring when different people name the same thing or concept differently.
Furnas et al. (1987) were perhaps the first to quantitatively study the vocabulary mismatch problem.〔Furnas, G., et al, The Vocabulary Problem in Human-System Communication, Communications of the ACM, 1987, 30(11), pp. 964-971.〕 Their results show that on average 80% of the times different people (experts in the same field) will name the same thing differently. There are usually tens of possible names that can be attributed to the same thing. This research motivated the work on latent semantic indexing.
The vocabulary mismatch between user created queries and relevant documents in a corpus causes the term mismatch problem in information retrieval. Zhao and Callan (2010)〔Zhao, L. and Callan, J., Term Necessity Prediction, Proceedings of the 19th ACM Conference on Information and Knowledge Management (CIKM 2010). Toronto, Canada, 2010.〕 were perhaps the first to quantitatively study the vocabulary mismatch problem in a retrieval setting. Their results show that an average query term fails to appear in 30-40% of the documents that are relevant to the user query. They also showed that this probability of mismatch is a central probability in one of the fundamental probabilistic retrieval models, the Binary Independence Model. They developed novel term weight prediction methods that can lead to potentially 50-80% accuracy gains in retrieval over strong keyword retrieval models. Further research along the line shows that expert users can use Boolean Conjunctive Normal Form expansion to improve retrieval performance by 50-300% over unexpanded keyword queries.〔Zhao, L. and Callan, J., Automatic term mismatch diagnosis for selective query expansion, SIGIR 2012.〕
== Techniques that solve mismatch ==

* Stemming
* Full-text indexing instead of only indexing keywords or abstracts
* Indexing text on inbound links from other documents (or other social tagging
* Query expansion. A 2012 study by Zhao and Callan〔 using expert created manual Conjunctive normal form queries has shown that searchonym expansion in the Boolean conjunctive normal form is much more effective than the traditional bag of word expansion e.g. Rocchio expansion.
* Translation-based models

抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)
ウィキペディアで「Vocabulary mismatch」の詳細全文を読む



スポンサード リンク
翻訳と辞書 : 翻訳のためのインターネットリソース

Copyright(C) kotoba.ne.jp 1997-2016. All Rights Reserved.